Tackling Overfitting in Evolutionary-Driven Financial Model Induction
نویسندگان
چکیده
This chapter explores the issue of overfitting in grammar-based Genetic Programming. Tools such as Genetic Programming are well suited to problems in finance where we seek to learn or induce a model from the data. Models that overfit the data upon which they are trained prevent model generalisation, which is an important goal of learning algorithms. Early stopping is a technique that is frequently used to counteract overfitting, but this technique often fails to identify the optimal point at which to stop training. In this chapter, we implement four classes of stopping criteria, which attempt to stop training when the generalisation of the evolved model is maximised. In this way, we hope to increase the generalisation of trading rules in buy/sell prediction problems. We show promising results using, in particular, one novel class of criteria, which measured the correlation between the training and validation fitness at each generation. These criteria determined whether or not to stop training depending on the measurement of this correlation they had a high probability of being the best among a suite of potential criteria to be used during a run. This meant that they often found the lowest validation set error for the entire run, and did so faster than other criteria.
منابع مشابه
A Preliminary Investigation of Overfitting in Evolutionary Driven Model Induction: Implications for Financial Modelling
This paper investigates the effects of early stopping as a method to counteract overfitting in evolutionary data modelling using Genetic Programming. Early stopping has been proposed as a method to avoid model overtraining, which has been shown to lead to a significant degradation of out-of-sample performance. If we assume some sort of performance metric maximisation, the most widely used early...
متن کاملAn Approach to Reducing Overfitting in FCM with Evolutionary Optimization
Fuzzy clustering methods are conveniently employed in constructing a fuzzy model of a system, but they need to tune some parameters. In this research, FCM is chosen for fuzzy clustering. Parameters such as the number of clusters and the value of fuzzifier significantly influence the extent of generalization of the fuzzy model. These two parameters require tuning to reduce the overfitting in the...
متن کاملFuzzy Logic Based Life Estimation of PWM Driven Induction Motors
Pulse-width modulated (PWM) adjustable frequency drives (AFDs) are extensively used in industries for control of induction motors. It has led to significant advantages in terms of the performance, size, and efficiency but the output voltage waveform no longer remains sinusoidal. Hence, overshoots, high rate of rise, harmonics and transients are observed in the voltage wave. They increase voltag...
متن کاملSelection of energy source and evolutionary stable strategies for power plants under financial intervention of government
Currently, many socially responsible governments adopt economic incentives and deterrents to manage environmental impacts of electricity suppliers. Considering the Stackelberg leadership of the government, the government’s role in the competition of power plants in an electricity market is investigated. A one-population evolutionary game model of power plants is developed to study how their pro...
متن کاملIterative Rule Segmentation under Minimum Description Length for Unsupervised Transduction Grammar Induction
We argue that for purely incremental unsupervised learning of phrasal inversion transduction grammars, a minimum description length driven, iterative top-down rule segmentation approach that is the polar opposite of Saers, Addanki, and Wu’s previous 2012 bottom-up iterative rule chunking model yields significantly better translation accuracy and grammar parsimony. We still aim for unsupervised ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012